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ABSTRACT 

We measured the angular clustering at z ~ 6 from a large sample of 1775 dropout galaxies (293 
with Z85o<27.5 from GOODS and 95 with £850<29.0 from the UDF). Our largest and most complete 
subsample (having L > 0.5L* =6 ) shows the presence of clustering at 94% significance. For this sample 
we derive a (co-moving) correlation length of ro = 4.5^3 \ Mpc and bias b — 4.1+26, using an 
accurate model for the redshift distribution. No clustering could be detected in the much deeper but 
significantly smaller UDF, yielding b < 4.4 (lcr). We compare our findings to Lyman break galaxies 
at z ~ 3 — 5 at a fixed luminosity. Our best estimate of the bias parameter implies that ^775 dropouts 
are hosted by dark matter halos having masses of ~ 10 11 Mq, similar to that of V606 dropouts at 
z ~ 5. We evaluate a recent claim that at z > 5 star formation might have occurred more efficiently 
compared to that at z = 3 — 4. This may provide an explanation for the very mild evolution observed 
in the UV luminosity density between z = 6 and 3. Although our results are consistent with such a 
scenario, the errors are too large to find conclusive evidence for this. 

Subject headings: cosmology: observations - early universe - large-scale structure of universe 



1. INTRODUCTION 



Th e Advanced Camera for Surveys (ACS; IFord et all 
1998) aboard the Hubble Space Telescope has made 
the detection of star-forming galaxies at z ~ 6 
(2775 dropouts) relatively easy. Th e largest sam- 
ple o f 1775 dropouts currently available l)Bouwens et al.l 
2006) comes from the Great Observatorie s Origins Deep 
Survey (GOODS: IGiavalisco et"all 12004ft . allowing the 
first quantitative a nalysis of galaxies only 0.9 Gyr af- 



ter r e combination ( Stanwa v et a 
20031 lYan fc Windhorst! 12004 



1120031 IBouwens et al l 



D ickinson et al.l 



Malhotra et aU 120051 see also iShimasaku et alJ 120051 
Ouchi et al.l l2005bft . IBouwens et all <|2006ft found evi- 



dence for strong evolution of the luminosity function be- 
tween z ~ 6 and 3, while the (unextincted) luminosity 
density at z ~ 6 is only ~ 0.8 times lower than that 
at z ~ 3. Some 2775 dropouts have significant Balmcr 
breaks, indicative of stellar populations older than 100 
Myr a nd masses comparable to those of L* galaxies at 
z w l|Evles et al.ll2005t lYan et alJl2005ft . 

Through the study of the clustering we can address 
fundamental cosmological issues that cannot be an- 
swered from the study of galaxy light alone. The 
strength of clustering and its evolution with redshift 
allows us to relate galaxies with the underlying dark 
matter and study the bias. The two-point angu- 
lar correlation function (ACF) has been used to mea- 
sure the clustering of Lyman break gala xies ( LBGs) 
at z = 3 - 5 (e .g. lAdelberger et all 119981 120051 
[ Arnquts etaD 119991 l2002t [Magliocchetti fc Maddoxl 
Il999t IGiavalisco fc Dickinson! 120011 lOuchi et alJ 120011 
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20041 iPorciani & Giavaliscol 120021 iHildebrandt et all 
20051 1 Allen et al J 120051 IKashikawa et al J 12006ft . LBGs 
are highly biased (b ~ 2 — 8), and this biasing de- 
pends strongly on rest frame UV luminosity and, to a 
lesser extent, on dust and redshift. The clustering statis- 
tics of LBGs have reached the level of sophistication 
that one can measure two physically different contribu- 
tions. At small angular scales the ACF is dominated 
by the non-linear clustering of galaxies within single 
dark matter halos, whereas at large scales its amplitude 
tends to the "clas sical" clustering of galaxies residin g 
in different halos ijOuchi et al.M2005ai ILee et all 12006ft . 
as explained within t he framework of the halo occupa- 
tion d istribution fe.g.. lZehavi et al.ll2004l iHamana et alJ 
12004ft . Understanding the clustering properties of galax- 
ies at z ~ 6 is important for the interpretation of 
"overdensities " observed towards l uminous quasars and 
in the field llOuchi et all l2005bl IStiavelli et all 120051 
IWang et aLll2005l IZheng et al.ll2006ft that could demar- 
cate structures th at preceded present-d ay massive galax- 
ies and clusters l)Springel et al.ll2005|) . Our aim here 
is to "complete" the census of clustering by extending 
it to the highest redshift regime with sizeable samples. 
In §§ 2 and 3 we describe the sample, and present our 
measurements of the ACF. In § 4 we discuss our find- 
ings. Throughout we use the cosmology (CIm, ^a, ^72, 
n, cr 8 )=(0.27,0.73, 1.0, 1.0,0.9) with H a = 72 h 72 km s" 1 
Mpc" 1 . 

2. DATA 

The present analysis is based on the sample of ^77 5 
dropouts described in detail by IBouwens et al.l f 2006) . 
We used the ACS data from the GOODS vl.O release, 
consisting of two spatially disjoint, ~ 16 arcmin 2 fields. 
Thes e data were processed with Apsis fBlakesle e et alJ 
2003), along with a substantial amount of overlapping 
data available from the Galaxy Evolution from Morphol- 
ogy a nd Spectral energy distributions (GEMS: |Rix et all 
l200l . supernova searches (A. G. Riess et al. 2006 and S. 
Pcrlmutter et al. 2006, both in preparation), and Ultra 
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Fig. 1. — Redshift distributions of 2775 dropouts in our GOODS 
(left) and UDF (right) selections (estimated by projecting a com- 
plete UDF -B435 dropout sample scaled to the siz es and colors as 
found for the 1775 dropout sample to z ~ 5 — 7; see lBouwens et alJ 
2006, for details). As a result of a more significant photomet- 
ric scatter in 2775— 2850 1 th e selection extends to lower redshifts in 
GOODS than it does for the UDF. 



Deep Field (UDF) NICMOS programs (|Thomnson et all 
2005). The processed images were brought to a uni- 
form signal-to-noise level by degrading the deeper parts 
of the area. The 10er detection limit of the degraded data 
was 27.5 in z 850 in a 0'.'2 diameter aperture. We also 
used a deep sample of 177* dropouts selected from the 
UDF l|Beckwith et, alJl200qi . covering one ACS pointing 
of ~ 11 arcmin 2 with a 10<r detection limit of 29.2. 

Objects were selected by requiring 1775-2350 > 1.3, and 
V606~-Z850>2.8 or a non-detection (2a) in Veo6 to exclude 
lower redshift interlopers. Point sources were removed 
based on high stellarity parameters >0.75. The esti- 
mated residual contamination due to photometric scat- 
ter, red interlopers, and stars is —7% to zs5n=28.0 , of 
which 2% is due to stars (see iBouwens et al] 120061 for 
details). The effective redshift distributions for GOODS 
and the UDF are shown in Figured] The effective rest- 
frame UV luminosity of the sa mple is L w 0.5L* =6 for 
zg5o^27.5 l|Bouwens et a l. 2006). Note that the luminos- 
ity is quite sensitive to redshift due to the Gunn-Peterson 
trough entering z 8 so at z > 6, with I/* =6 corresponding 
to z 850 ~26.5 (-28) at z = 5.5 (z = 6.5). 

3. THE ANGULAR CORRELATION FUNCTION 

We measured the ACF, w(6), defined as the excess 
probability of finding two sources in the solid angles 
6Q1 and 8Q2 separated by the angle 9, over that ex- 
pecte d for a random Poissonian distribution ijPeeblesI 
1980). We used t he estimator w(9) = \DD (9)-2DR(9) + 
RR(6))/RR(0) of lLandv fc Szalavl jf9931. where DD(6), 
DR(6) and RR(9) are the number of pairs of sources 
with angular separations between 9 and 9+A9 measured 
in the data, random, and data-random cross catalogs, re- 
spectively We used 16 random catalogs containing — 100 
times more sources than in the data, but with a similar 
angular geometry . Errors on w(9) were bootstrapped 
l|Ling et alJ ll986). We assumed a power-law ACF of the 
form w(9)=A w 9~ /3 and determined its amplitude, A w , 
by fitting the function w(9)=A w 9~P —IC . The integral 
constraint [IC= J J w(9)dflidfl2 / , where f2 is the sur- 
vey area] was 0.033A W for GOODS and 0.074A lu for the 



UDF. We did not attempt to fit the slope of the ACF 
and assumed /3 = 0.6 based on the results of iLee et alJ 
(2006). The ACF was fitted over the range 10"-300" 
(10"-200" for the UDF), corresponding to roughly 0.4- 
10 Mpc comoving at z — 6. The lower value of 
10" is larger than the virial radius of a 10 12 Mq halo to 
ensure that we are measuring the large-scale clustering 
(and not receiving a contribution at small scales from the 
subhalo component). Because the results of the fits are 
sensitive to the size of the bins used, we determined A w 
using Monte Carlo simulations of the data. Finally, we 
note that if the contaminants to our samples (~7% of the 
total) have a uniform distribution, the measured ampli- 
tude should be multiplied by —1.16 to yield the corrected 
clustering amplitude. 

3.1. Results from GOODS and the UDF 

Figure H (top panels) shows the GOODS and UDF 
ACFs for various limiting magnitudes. The results of 
the fits are given in Tabled For the three brightest sub- 
samples (z850<28.5) we measured a positive signal out 
to 9 ~ 1'. In GOODS, we found A w = 2.71 ± 2.05 
for z 850 <27.0, and A w w 0.80 ± 0.69 for z 850 <27.5. 
The analysis of the UDF is hampered by the relatively 
small number of sources available, owing to its ~ 30 
times smaller area, although its greater depth (1.5 mag) 
partially makes up for this lack of area. We found 
A w = 1.40 ±1.64 and A w = 0.00 ±0.93 for the z 850 <28.5 
and Z850<29.0 samples, respectively. 

Because the objects were selected from data of uniform 
depth, signal in the ACF is unlikely to be caused by varia- 
tions in the object surface density. Given the large errors 
on A w , it is useful to ask whether the w(9) observed at 
9 < V could be the result of shot noise in a random ob- 
ject distribution. We created 1000 random distributions 
with the same geometry and the same number of points 
as our GOODS and UDF data, and calculated the ACF 
in each of the random samples. The mean and standard 
deviation at each 9 is plotted in Figure |5] (top panels, 
offset by -0.4 for clarity). We calculate the chance of 
reproducing the observed clustering in the random real- 
izations, using the average w(9) measured over the first 
four bins (9 < 100") as a gauge of this clustering. This 
chance is 0.1% for our z 8 5u<27.0 sample, and 6%, 10% 
and 35% for the fainter samples. 

Another test of the clustering signal detected in this 
sample was as follows. We used the formalism of 
iSoneira fc Peebles! l)1978|) to create mock samples with a 
choice ACF in two dimensions. A 250' x 250' mock field 
with surface density similar to that of the 1775 dropouts 
allowed us to mimic the measured A w to an accuracy 
of 98%, determined from a fit. Next, we randomly ex- 
tracted 100 mock "GOODS" surveys and measured the 
mean w(9) and its standard deviation using identical bin- 
ning and fitting to that for the real samples. The result is 
indicated in Figure [5] (hatched region) for our z 8 so<27.5 
sample, which being our largest and most complete sam- 
ple provides the most reliable constraint on this cluster- 
ing. The simulation demonstrates that the amplitude of 
the observed w(9) at 9 < 1' lies within < la of the ampli- 
tudes predicted based on our model ACF, although the 
scatter in the expected amplitudes is large. 

In the above analysis we restricted ourselves to cluster- 
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Fig. 2. — Top panels show the measurements (points) and the best-fit ACF (solid line) for the 1775 dropout samples extracted from 
GOODS and the UDF. The slope was kept fixed at j3 = 0.6. Shot noise expectations have been indicated by empty error bars (offset 
by -0.4 in the vertical d irection for clarity). The middle and bottom panels show, respectively, the (non)linear clustering of dark matter 
IPeaco ck & Dodds 1996) and the bias at each 9 (points) and for the best-fit ACF (solid line). The hatched region indicates the lc range 
of w(6) expected based on a simulated distribution with the same clustering amplitude as measured for the 2850 <27. 5 sample. 



TABLE 1 

ACF AND RELATED PHYSICAL QUANTITIES. 



2 8 50 Area a A w r 

(mag) N(<z) (arcmin ) (arcsec' 3 ) (/if/ Mpc) b (30") 

Enhanced 13 GOODS Data 

27.0 172 2 x 160 2.71 ± 2.05 9.6+g g 

27.5 293 2 x 160 0.80 ± 0.69 4.5±| a 

UDF Data 

28.5 52 1 x 11 1.40 ± 1.64 < 10.2 <TH) 

29.0 95 lxll 0.00 ±0.93 < 4.8 < 4.4 

a Approximate areal c overage that meets o ur S/N requirements for 
*775 dropout selection IBouwens et 
b See § 2 for details. 



ing at 9>10". Our measurements also showed an excess 
of pair counts at 6<10". Upon closer inspection it was 
found that the excess was strictly limited to 8<5", with 
iu(2'.'5)^2.0 ± 0.9. The excess is consistent with an en- 
hancement of w(9) due to subhalo clustering at < 30 
kpc to 1.7cr confidence, but the exact amplitude cannot 
be determined accurately due to the small number of 
pairs (rr pairs at zg5o<28.0). The ex cess is similar to 
that found for Lya emitters at z = 5.7 l)Shimasaku et aLl 
2006). While it is possible that the positive signal out 
to ~ V is the result of strong sub halo clustering (see 
ILee et al.ll2006l IQuchi et al] I2005bl) . the occurrence of 
such halos becomes increasingly rare with redshift, and 
by limiting the fits to 9 >10" we minimized any contri- 
bution. 

4. DERIVATION OF COSMOLOGICAL QUANTITIES 

Although the uncertainties are large, we estimate the 
spatial correlation length (tq) from A w , using the Lim- 
ber equation adopted for our cosmology and the redshift 
distributions of Figure □ (see Table UJ). The clustering 
was assumed to be fixed in comoving coordinates across 
the redshift range. We found r =4.5±f j Mpc for 
the 2850<27.5 sample. At Z850<27, the best-fit value was 
found to be twice as high, ro=9.6^5g Mpc, but con- 
sistent with the fainter subsample within the errors. For 
the UDF samples, the best-fit values correspond to upper 



limits for the clustering amplitude of ~ 10 and ~ 5 h^ 2 
Mpc for z 8 50<28.5 and Z850<29, respectively. If we apply 
the contamination correction, tq increases by ~10%. 

We calculated the galaxy-dark matter bias, defined 
as b(9) = ^/w(9) /w dm (9) , where w dm (9) is the ACF of 
the dark matter as "seen" through our redshift window; 
Wdm (9) was calculated us i ng the nonlinear fitting func- 
tion of IPeacock fc Doddsl l)1996l) (Fig. middle pan- 
els) . In the bottom panels of Figure [21 we have indi- 
cated the bias as a function of 9 (points). Our best-fit 
ACF at z 850 <27.5 implies &(0~3O")=4.1±^ (solid line), 
bracketed by b ~ 8 for the brightest GOODS sample and 
b < 4.4 for the faintest UDF sample. The contamination 
correction yields values that are ^5% higher. 

It is important to evaluate how our results might be 
influen ced by cosmic variance. Using ISomerville et al J 
(2004), we estimate o-„~0.2 for GOODS and a v ~0.5 for 
the UDF (o~ v being the square root of the cosmic vari- 
ance). Our best constraint on clustering at z = 6 is 
therefore currently provided by the Z85o<27.5 sample, 
given the relatively small variance, large sample size, and 
large completeness. Our best value for the bias of 1775 
dropouts is very similar to the bias of b = 3.4 ± 1.8 found 
for fai nt Lyq emitters in t he Subaru/XMAf-A^ewioriDeep 
Field ijOuchi et al.ll2005b|) . An alternative method of es- 
timating the bias is to directly compare the 1775 dropout 
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Fi g. 3. — Bias of 1 775 dropouts compared to bias at z = 3 — 5 
from Lcc ct al. ( 2006). Dashed lines indicate the bias of dark halos 
from ISheth & Tormed ft999t) for M halo > 10 10 , 5 x 10 10 , 10 11 , 
5 X 10 11 , and 10 12 Mq (bottom to top). The relatively small halo 
mass inferred at z = 5 compared to that at z = 4 for objects with 
M z < —20 (squares) cannot be confirmed at z = 6 based on the 
present data. The best-fit halo mass inferred for objects at 2 = 6 
is consistent with the average halo mass of Vbo6 dropouts at z = 5 
at a fixed luminosity of M z < —19.5 (ci rcles). The star indi cates 
the bias of Ly« emitters at z = 5.7 from Ouchi ct al. 1 2005b). 



number density to the number density of dark halos at 
z = 6. Assuming that the bias of the 1775 dropouts cor- 
responds to that of dark h alos more massive tha n the 
average halo hosting th e m iSheth fc Tormenlll999|) . fol- 
lowing [Some^iII^On <|2004() we predict that the aver- 
age bias ranges from 6«5.5tofeRi3.8 for samples with 
limiting magnitudes from Z85o=27 to zs50=29, with ^5% 
errors in these estimates due to the uncertainty in num- 
ber density caused by cosmic variance. These values are 
largely in agreement with our measurements. The model 
predictions furthermore suggest that our best-fit value 
measured for the brightest GOODS sample (zs50<27.0) 
is likely spuriously high, given that the number density 
changes by not more than a factor of 2 over half a mag- 
nitude, giving only a modest increase in the bias com- 
pared to the Z85o<27.5 sample. This GOODS sample 
thus adds very little additional constraints to the clus- 
tering at z = 6. The UDF samples suffer from relatively 
small number statistics, as well as large cosmic variance. 
The clustering signal in the UDF is likely further dimin- 
ished due to the strong luminosity de pendence of clus- 
tering as seen at lower redshift (e.g., iKashikawa et al.1 



I n Figure we compare our results to the work 
of iLee et al.l 1(2XM\ . who found 6«3.3±0.5 for faint 
Ve>oe dropouts (z ~ 5) also selected from GOODS. At 
zg5o^27.5 we probe approximately the same rest-frame 
luminosity as their faintest (i.e., Z850<27) Vqog dropout 
sample (M z < —19.5). To this limit, we measure a bias of 
b = 4.1+2'g, suggesting an average halo mass of ~ 10 11 
Mq, comparable to the average mass of halos hosting 
Vqoq dropouts. 

Interestingly, ILee et all l|200fit) found that at slightly 
higher rest-frame luminosities [M z < —20), the cluster- 
ing of V(306 dropouts is weaker than that of U and -B435 
dropouts at z = 3 — 4. It is hence inferred that the 
halo mass at z = 3 — 4 is ~ 10 times larg er (~10 12 
M(p) c ompared to that at z = 5 (see Fig. 3). ILee et all 
(2006) argued that star formation occurred more effi- 
ciently at higher redshifts (z ~ 5) than it did at z ~ 3 — 4, 
given that objects of comparable luminosity are found 
in less massive halos at z ~ 5. Unfortunately, this re- 
sult cannot be confirmed at z ~ 6 using our brightest 
(M z < —20) GOODS sample, given the large uncertain- 
ties in the bias and the associated halo mass. Also, the 
decrease in the effective halo mass from z = 4 to 5 at 
M z < —19.5 is not as dramatic as observed at luminosi- 
ties of M z < —20, making it difficult to verify this re- 
sult based on the present 1775 dropout sample. However, 
we note that if a decrease in the star-forming efficiency 
with decreasing redshift is true (and can be confirmed for 
galaxies at z > 6), it would largely offset changes that 
are occurring in the mass function over this range. As 
such, this may provide at least a partial explanation for 
the mild evolution in the luminosity density from z = 6 
to 3. 

In conclusion, we used the largest available sample of 
*775 dropouts to study clustering at z ~ 6. We found 
a small signal, although its amplitude is not well con- 
strained due to the large errors on the individual data- 
points. The present analysis is therefore reminiscent of 
that performed at z ~ 3 — 5 based on the original Hub- 
ble Deep Fields. The clustering of galaxies at z ~ 6 
wil l continue to be studied from deep, wide surveys (e.g., 
see lOuchi et al.l2005bl:IShimasaku et alJ2005l 12006]) . Al- 
though it might become possible in the near future to 
increase the size of our faint ACS samples by relaxing 
our current 1775 dropout detection threshold, to perform 
an analysis at the same level of detail as currently per- 
formed at z ~ 5 would require another six GOODS fields, 
for ^1200 arcmin 2 in total. 
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